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Abstract 

This paper studies convergence properties of multivariate distribu¬ 
tions constructed by endowing empirical margins with a copula. This 
setting includes Latin Hypercube Sampling with dependence, also 
known as the Iman-Conover method. The primary question addressed 
here is the convergence of the component sum, which is relevant to risk 
aggregation in insurance and finance. This paper shows that a CLT 
for the aggregated risk distribution is not available, so that the un¬ 
derlying mathematical problem goes beyond classic functional CLTs 
for empirical copulas. This issue is relevant to Monte-Carlo based risk 
aggregation in all multivariate models generated by plugging empir¬ 
ical margins into a copula. Instead of a functional CLT, this paper 
establishes strong uniform consistency of the estimated sum distribu¬ 
tion function and provides a sufficient criterion for the convergence 
rate in probability. These convergence results hold for all 

copulas with bounded densities. Examples with unbounded densities 
include bivariate Clayton and Gauss copulas. The convergence results 
are not specific to the component sum and hold also for any other com¬ 
ponentwise non-decreasing aggregation function. On the other hand, 
convergence of estimates for the joint distribution is much easier to 
prove, including CLTs. Beyond Iman-Conover estimates, the results 
of this paper apply to multivariate distributions obtained by plugging 
empirical margins into an exact copula or by plugging exact margins 
into an empirical copula. 
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1 Introduction 

In various real-world applications, multivariate stochastic models are con¬ 
structed upon empirical marginal data and an assumption on the depen¬ 
dence structure between the margins. This dependence assumption is often 
formulated in terms of copulas. The major reason for this set-up is the lack 
of multivariate data sets, as it is often the case in insurance and finance. 
This approach may appear artificial from the statistical point of view, but it 
arises naturally in the context of stress testing. In addition to finance and 
insurance, relevant application areas include engineering and environmental 
studies. Sometimes the marginal data is not even based on observations, 
but is generated by a univariate model that is considered reliable. Many of 
these models are so complex that the resulting distributions cannot be ex¬ 
pressed analytically. In such cases exact marginal distributions are replaced 
by empirical distributions of simulated univariate samples. These empirical 
margins are endowed with some dependence structure to obtain a multivari¬ 
ate distribution. The computation of aggregated risk or other characteristics 
of this multivariate model is typically based on Monte-Carlo techniques. 

Iman—Conover: dependence “injection” by sample re¬ 
ordering 

Related methods include generation of synthetic multivariate samples from 
univariate data sets. Whilst the margins of such a synthetic sample accord 
with the univariate data, its dependence structure is modified to fit the ap¬ 
plication’s needs. The most basic example is the classic Latin Hypercube 
Sampling method, which mimics independent margins. It is a popular tool 
for removing spurious correlations from multivariate data sets. This method 
is also applied to variance reduction in the simulation of independent ran¬ 
dom variables (cf. McKay et ah, 1979; Stein, 1987; Owen, 1992; Iman, 2008). 
Similar applications to dependent random variables include variance reduc¬ 
tion in Monte-Carlo methods (Packham and Schmidt, 2010) and in copula 
estimation (Genest and Segers, 2010). 

An extension of Latin Hypercube Sampling that brings dependence into 
the samples was proposed by Iman and Conover (1982). The original de¬ 
scription of the Iman-Conover method uses random reordering of marginal 
samples, and the intention there was to control the rank correlations in the 
synthetic multivariate sample. The reordering is performed according to the 
vectors of marginal ranks in an i.i.d. sample of some multivariate distribu¬ 
tion, say, H, with continuous margins. Thus rank correlations of H are “in¬ 
jected” into the synthetic sample. This procedure is equivalent to plugging 
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empirical margins (obtained from asynchronous observations) into the rank 
based empirical copula of a sample of H (Arbenz et ah, 2012). Moreover, 
it turned out that the Iman-Conover method allows to introduce not only 
the rank correlations of H into the synthetic samples, but the entire copula 
of H (cf. Arbenz et ah, 2012; Mildenhall, 2005). In somewhat weaker sense, 
these results are related to the approximation of stochastic dependence by 
deterministic functions and to the pioneering result by Kimeldorf and Samp¬ 
son (1978). Further developments in that area include measure preserving 
transformations (Vitale, 1990) and shuffles of min (Durante et ah, 2009). In 
statistical optimization, reordering techniques were also used by Ruschendorf 
(1983). A very recent, related application in quantitative risk management 
is a rearrangement algorithm that computes worst-case bounds for the ag¬ 
gregated loss quantiles in a portfolio with given marginal distributions (cf. 
Embrechts et ah, 2013, and references therein). 

Using explicit reorderings of univariate marginal samples, the Iman- 
Conover method has a unique algorithmic tractability. It is implemented 
in various software packages, and it serves as a standard tool in dependence 
modelling and uncertainty analysis. The reordering algorithm allows even 
to construct synthetic samples with hierarchical dependence structures that 
meet the needs of risk aggregation in insurance and reinsurance companies 
(Arbenz et al., 2012). The distribution of the aggregated risk is estimated by 
the empirical distribution of the component sums -f.. of the syn¬ 

thetic samples X^^^ = (vj^\ ..., X^^^) for k = 1,..., n. This Monte-Carlo 
approach has computational advantages. The resulting convergence rate of 
(or even faster with Quasi-Monte-Carlo using special sequences) allows 
to outperform explicit calculation of sum distributions already for moderate 
dimensions d > 4 (cf. Arbenz et ah, 2011). 

Challenge and contribution: convergence proofs 

Despite its popularity, some applications of the Iman-Conover method have 
been justified by simulations rather than by mathematical proofs. The 
original publication (Iman and Conover, 1982) derives its conclusions from 
promising simulation results for the distribution of the following function of 
a 4-dimensional random vector: /(Xi,..., X 4 ) = Xi -|- X 2 (X 3 — log |Xi|) -|- 
exp(X 4 / 4 ). Yet a rigorous proof is still missing. The present paper provides 
a convergence proof for Iman-Conover estimates of the component sum dis¬ 
tribution. It also includes a proof sketch for the much simpler case of the 
estimated joint distribution. Both problems have been open until now. 

The solutions given in this paper are derived from the empirical process 
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theory as presented in van der Vaart and Wellner (1996). Under appropri¬ 
ate regularity assumptions, Iman-Conover estimates of the sum distribution 
are strongly uniformly consistent with convergence rate (see The¬ 

orems 4.1 and 4.2). The convergence of Iman-Conover estimates for the 
joint distribution is discussed in cf. Remark 4.8. All these findings are not 
specific to the component sum and extend immediately to all component¬ 
wise non-decreasing functions (see Corollary 4.10). Moreover, Theorems 4.1 
and 4.2 also cover the convergence of aggregated risk distributions obtained 
by Monte-Carlo sampling of a multivariate model constructed by plugging 
empirical margins into a copula (see Remark 4.9). In fact, both sampling 
methods (reordering by Iman-Conover and classic top-down sampling with 
empirical margins instead of the exact ones) lead to the same mathematical 
problem. This is discussed in Remark 3.2(d). 

The regularity assumptions used here to establish the Op(n“^/^) conver¬ 
gence rate for Iman-Conover estimates of sum distributions are satisfied for 
all copulas with bounded densities. This case includes the independence 
copula in arbitrary dimension d > 2. The assumptions are also satisfied for 
all bivariate Clayton copulas and for bivariate Gauss copulas with correla¬ 
tion parameter p > 0. The convergence rate for p < 0 is, if at all, only 
slightly weaker. The best bound that is currently available for p < 0 is 
Op(n“^/^^logn). 

The regularity assumptions for the marginal distributions involved in the 
Iman-Conover method are absolutely natural, and they are always satisfied 
by empirical distribution of i.i.d. samples: Strong uniform consistency of 
Iman-Conover estimates needs strong uniform consistency of consistency of 
empirical margins, whereas the uniform Op(n“^/^) convergence rate of Iman- 
Conover requires the same uniform convergence rate of Op(n“^Aj in the 
margins. 

Why a precise CLT remains elusive 

The convergence results obtained here are related to standard convergence 
results for empirical copulas (cf. Riischendorf, 1976; Deheuvels, 1979; Ferma- 
nian et al., 2004; Segers, 2012). However, the mathematical problem for the 
sum distribution goes beyond the standard setting, where empirical measures 
are evaluated on rectangular sets. In the case of sum distributions, the usage 
of empirical margins in the construction of the multivariate model signifi¬ 
cantly extends the class of sets on which the empirical process of the copula 
sample should converge. As shown in Section 3, the canonical way to prove 
asymptotic normality for the Iman-Conover estimator of the sum distribution 
would need a uniform CLT for the copula sample on the collection of so-called 
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lower layers in [0,1]“^. However, this class is too complex for a uniform CLT 
(cf. Dudley, 1999, Theorems 8.3.2, 12.4.1, and 12.4.2). For this reason the 
proofs of consistency and convergence rate presented here sacrifice the precise 
asymptotic variance and use approximations that allow to simplify the prob¬ 
lem. This technical difficulty is not specific to the Iman-Conover method. 
It also arises in any other application where multivariate samples are gener¬ 
ated from a simulated copula sample and empirical marginal distributions. 
As mentioned above, this approach is very popular in practice, especially 
for computational reasons. Similar problems also arise in applications that 
combine exact marginal distributions with empirical copulas. 

Structure of the paper 

The paper is organized as follows. Section 2 introduces the reordering method 
and highlights the relations between sample reordering and empirical copulas. 
The complexity issues are discussed in Section 3. The convergence results 
are established in Section 4. The underlying regularity assumptions are dis¬ 
cussed in Section 5, including examples of copula families that satisfy them. 
Conclusions are stated in Section 6. 


2 Empirical copulas and sample reordering 

Let X = {Xi,...,Xd) be a random vector in with joint distribution 
function F, marginal distribution functions Fi,, F^, and copula C. That 
is. 


F{x) = C{F,{xi),...,Fd{xd)), ( 1 ) 

where C is a probability distribution function on [0,1]'^ with uniform margins. 
By Sklar’s Theorem, any multivariate distribution function F admits this 
representation. 

We assume throughout the following that F) are unknown and that we 
have some uniform approximations Fi^n, i = 1,... ,d. The true margins F) 
need not be continuous. In this case the representation (1) is not unique, 
but it is not an issue in our application, which is rather computational than 
statistical. As sketched in the Introduction, we consider the case where only 
univariate, asynchronous observations of the components Xi are available, 
and the copula C is set by expert judgement to compute the resulting distri¬ 
bution of the component sum. In practice, the choice of the copula C aims 
at dependence characteristics that are known or assumed for the random 
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vector X. This choice also depends on the ability to sample C or any other 
multivariate distribution with continuous margins and copula C. 

To keep the presentation simple, we assume that Fi^n are empirical distri¬ 
bution functions of some univariate samples xl^\ ...,for i = 1,... ,d: 

^ k=l 


These samples need not be i.i.d. We will only assume that \\Fi^n — -^iiloo —t 0, 
either P-a.s. or in probability. Extensions to the general case will be given 
later on. 

Let < • • • < denote the order statistics of the Tth compo¬ 

nent Xi for i = 1,... ,d, and let Ph denote the probability measure with 
distribution function H. The Iman-Conover method approximates Pi? by 
the empirical measure of the following synthetic multivariate sample: 


:= 




j = l,...,n, 


(3) 


where rI^\ ...,for i = 1,... ,d are the marginal ranks of a simulated 
i.i.d. sample ..., ~ C: 

n 

Ap = * = !,...,d, j = l,...,n. 

k=l 

It is easy to verify (cf. Arbenz et al., 2012, Theorem 3.2) that the empirical 
distribution function of the synthetic sample (3) is equal to 

F:{x)-.= CI{F,A^,),...,F,A^,)\ (4) 

where C* is the rank based empirical copula of ..., 

^ ^ 1 I ^Rf^ < Ml, • • •, ^Rf < uA . (5) 

^ k=l ^ 

This links the convergence of the Iman-Conover method to the convergence 
of Pi?* to Pi?, and hence to the convergence of Pc* to Pc- 

Remark 2.1. (a) The central application of the Iman-Conover method dis¬ 
cussed in the present paper is the computation of the aggregated risk 
distribution. The most common risk aggregation function is the sum. In 
this case one must compute the probability distribution of the random 
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variable ■ ■ ■,~ F with F defined in (1). Iman- 

Conover involves two approximations: replacing the unknown margins 
Fi by their empirical versions Fi^n, and replacing the known (or treated 
as known) copula C by its empirical version C*. 

(b) Using U* may appear unnecessary because one can also compute the 
sum distribution for a random vector with margins Fi^n and exact cop¬ 
ula C. However, computation of sum distributions from margins and 
copulas is quite difficult in practice. It involves numeric integration on 
non-rectangular sets, which cannot be reduced to taking the value of 
C (Fi^„(a;i),..., Fd^n{xn)) for a few points x = {xi,, Xd). Implementa¬ 
tions of this kind are exposed to the curse of dimensions. Monte-Carlo 
methods, which Iman-Conover belongs to, have the convergence rate 
of l/\/n, and Quasi-Monte-Carlo methods using special sequences may 
even allow to achieve the rate 1/n. According to Arbenz et ah (2011), ex¬ 
plicit computation of sum distributions is outperformed by Monte-Carlo 
already for <7 = 4. 

(c) Another motivation of the Iman-Conover method is its flexibility and al¬ 
gorithmic tractability. It only includes reordering of samples and works 
in the same way for any dimension. Moreover, sample reordering is com¬ 
patible with hierarchical dependence structures that can be described 
as trees with univariate distributions in leaves and copulas in branching 
nodes (cf. Arbenz et ah, 2012). In each branching node, the marginal 
distributions are aggregated according to the node’s copula and the re¬ 
sulting aggregated (typically, sum) distribution is propagated to the next 
aggregation level. As shown in Arbenz et ah ( 2012 ), sample reordering 
can be implemented for a whole tree. The setting with one copula and 
margins discussed in the present paper is the basic element of such aggre¬ 
gation trees. The results presented here allow to prove the convergence 
of the aggregated (say, sum) distribution in every tree node, including 
the total sum. 

Now let us return to the technical details of the Iman-Conover estimator 
for the aggregated sum distribution. As the random variables are con¬ 
tinuously distributed, they have no ties P-a.s. Thus Pc* consists P-a.s. of n 
atoms of size 1/n. Moreover, these n atoms build a Latin hypercube on the 
d-variate grid f,..., 1}"*, i.e., each section {a: e {^, f,..., 1}'' : a:* = ^} 
for i = 1,..., d and j = 1,... ,n contains precisely one atom. Therefore 
the Iman-Conover method is also called Latin Hypercube Sampling with 
dependence. 

For X ~ F, let G denote the distribution function of the component sum: 
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G{t) := P(Xi + ... + Xrf < t), t G M. The relation between G and Pc can 
be expressed as follows. 

Lemma 2.2. 


WteR G{t)=Pc{{T{A,))) ( 6 ) 

where At := {x G : Yn^iXi < t}, T{x) := (Fi(xi),... ,Fd{xd)), and 
(B) ■= UueB[0,u\ /or 5 C [0,1]'^. 

The notation [0 ,m] refers to the closed d-dimensional interval between 0 
and u: [0, u] := [0, Ui] x ■ ■ ■ x [0, Ud]- 

Proof. Let U G and denote 

T-(t.):=(Fr frw) 

for u G [0,1]'^, where F^{y) := inf{f G M : Fi(t) > y} is the quantile function 
of Fi. It is well known that T'^{U) ~ F. Hence 

G{t) = P^(Hi) = P (T-(t/) G At) = Pc ({m e [0,1]" : T^{u) G A}) , 

and it suffices to show that v G {T{At)) is equivalent to T*^{y) G At. 

If T^(r) G A, then ToT^(n) g (T(A)). Due to FtoFtivt) > vt for all 
i this implies that n G [0,T ° T'^{v)] C (T(A))- 

If r G {T{At)), then v < T{x) (componentwise) for some x G A- Since 
FAoF(xj) < Xi for all this yields T‘*“(n) < x. As the function x h->• 
is componentwise non-decreasing, we obtain that T^(n) G At. □ 

Remark 2.3. (a) The measurability of {T{At)) follows from the equivalence 
of r G {TiAt)) and T^{v) G At. 

(b) The purpose of the operator (■) is to guarantee that for u G {B) and 
V G [0,1]'^ the componentwise ordering v < u implies v G (F). This 
immediately yields ((F)) = (F). Consistently with Dudley (1999), we 
will call (F) the lower layer of F. This set class is also mentioned in 
the context of nonparametric regression (cf. Wright, 1981, and references 
therein). 

(c) The sets {T{At)) are closed if the marginal distributions Fj have bounded 
domains, but not necessarily in the general case. If, for instance, Fi = F 2 

are standard normal distributions, then (T(Ao)) = {m G [0,1]^ ■. ui + U 2 < 1}\ 
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{(0,1), (1,0)}. This example with punctured corners is quite proto¬ 
typical. It is easy to show that if a sequence in {T{At)) con¬ 
verges to u ^ {T{At)), then u is on the boundary of [0,1]'^. Indeed, 
if M G (0, l)*^, then G (0,1)“^ for sufficiently large n. This allows 
to construct a sequence —>■ u such that and < u 

for all n. For any ^ (T{At)) there exists G At such that 
2 "(^(n)) > ^{n)^ jg non-decreasing and o F{xi) < Xi for all f, 

we have and hence G At for all n. 

Since all F^ are left continuous on (0,1), we obtain —)■ T^{u) 

and T^{u) G At. As Ft o F^{ui) > Ui for all we obtain u G {T(At)). 
Thus u ^ {T{At)) is only possible for u G cl[0, l]'^. 

By construction, {T{At)) includes all points u G 5[0,1]'^ such that u + 
eci G {T{At)) for some unit vector e*, f = 1,..., d, and £ > 0. Thus the 
area where the set {T{At)) does not include its boundary points is very 

small. 

Let us now return to the estimation of the sum distribution G{t) = 
PpiAt). The empirical distribution of the component sum in the synthetic 
sample (3) is nothing else than the empirical multivariate distribution of this 
sample evaluated at the sets Ap 


G:(t) := - 


n f 

k=l 




(7) 


k=l ^ ^ 


, 77 , 




G Ai 


Analogously to (6), G*(t) = PF*{At) can be written in terms of the empir¬ 
ical copula C* defined in (5) and, as next step, in terms of the empirical 
distribution Cn of the i.i.d. copula sample ..., 

1 

^ k=l 

Let Ci^n denote the margins of and let denote the corresponding 
quantile functions. To avoid technicalities, we consider CF as mappings 
from [0,1] to [0,1]: 


■= inf{v G [0,1] : Ci^n{v) > m}, ue [0,1]. 

Denote Tn{x) := (Fi,„(a;i),..., and T„ := o r^, where := 

(C^(a;i),..., C^^(a;,)). Then we can state the following result. 
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Corollary 2.4. 


Vt G M 

and, with probability 1, 

G:(t) = Pc*((r„(A))) 

(8) 

Vf G M. 

G:(t) = Pc„((T„(A))). 

(9) 


Proof. It is easy to see that the synthetic sample (3) can be written as 

= k = l,...,n, 

where r^(a;) := , F^^{xd)) and pn{x) := {Ci^n{xi),... ,Cd,n{xd))- 

This yields 

n 

o:(t) = -j2i{Troppu^'->)eA}. (10) 

" k=l 

According to the proof of Lemma 2.2, rfdix) G At is equivalent to x G 
{Tn{At)). Hence (10) implies 

1 

G:(t) = -'£i{p„{u'-’‘y)e{T„A.)}, 

" k=l 

which is the same as ( 8 ) because Cf is the empirical distribution function of 
p„(f/W),...,p„(17W). 

Being continuously distributed, ..., have different values P-a.s. 
for each i. Hence the mapping pfp is componentwise P-a.s. strictly increasing 
on {^,..., 1 }'^ with probability 1 , and therefore 

1 {pn e (t„(A,)>} = i{pto P„ (t/W) e {pr((’-n(A))))} P-a.s. 

Thus (9) follows from p^ o pn{U^^^) = and (p) 7 ((r„(At)))) = o 
rn{At)). □ 

Remark 2.5. Uniform consistency of Fi^n and implies T„ —)■ T in 

3 Complexity of the problem 

The representation (9) translates the asymptotic normality of G* into a CLT 
for Cn uniformly on the random set sequence ((T„(Ai)) : n G N). The 
canonical way to prove results of this kind is to establish a uniform CLT on 
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the set class Td of all possible {Tn{At)) and {T{At)). This is the natural set 
class to work with if Fj are unknown and estimated empirically. The index d 
in the notation Td highlights the dimension. Since Fi need not be continuous, 
the set of all T includes all T„, so that Td is simply the collection of all possible 
{T{At)). Furthermore, if each unknown margin Fj has a positive density on 
entire M, then the resulting empirical distributions Fj can take any value 
in the class of all possible stair functions on M with steps of size 1/n going 
from 0 to 1. In this case the class of all possible {Tn{At)) is dense (w.r.t. 
Hausdorff metric) in the class of all possible T{At). Thus, even though there 
is only one limit transformation T that really matters to us, considering the 
class of all possible {T{At)) does not add more complexity to the problem. 

It is also easy to see that even pointwise asymptotic normality of G* (t) 
in some t = to would require a uniform CLT on Td- Shifting the unknown 
margins F*, one can easily generate all possible sets T{At) from a single At^. 
Thus the complexity of the problem is the same for the uniform and for the 
pointwise asymptotic normality of G*. 

There are various functional CLTs for empirical copulas (cf. Riischendorf, 
1976; Deheuvels, 1979; Fermanian et ah, 2004; Segers, 2012, and references 
therein). However, empirical copula estimates are empirical measures eval¬ 
uated on the set class TZd of rectangle cells (—cxd, a] := (—cxd, oi] x ... x 
(—cxD, a^] for a G M'^. This set class is simple enough to be universally 
Donsker. A set class C is called P-Donsker if the empirical measure Pn(F) := 
^ Ylk=i of an i.i.d. sample ..., ~ p satisfies 

V^(P„(F)-P(F)) AGp(F) (11) 

as a mapping in 1°°{C), where Gp is the so-called Brownian bridge “with 
time” P. That is, Gp is a centred Gaussian process with index B & C and 
covariance structure 

cov(Gp(A), Gp(F)) = P{A n F) - P(A)P(F). 

The Donsker property of C is called universal if it holds for any probability 
measure P on the sample space. 

The symbol in (11) refers to the extended notion of weak convergence 
for non-measurable mappings in 1°°{C) as used in van der Vaart and Wellner 
(1996). See Remark 4.5 for further details. 

Sufficient conditions for a set class to be Donsker can be obtained from 
the entropy of this set class. Entropy conditions can be formulated in terms 
of covering numbers or bracketing numbers (cf. van der Vaart and Wellner, 
1996, Sections 2.1 and 2.2). Entropy bounds that do not depend on the un¬ 
derlying probability measure are called uniform. The most common sufficient 
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criterion for uniform entropy bounds guaranteeing that a set class is univer¬ 
sally Donsker is the Vapnik-Cervonenkis (VC) property. A set class C is VC 
if it does not shatter any n-point set for sufficiently large n. 

The set ..., xV')} is shattered by C if every subset A C ..., xV')} 
can be obtained as A = i? fl ..., xV')^ with some B E C. The smallest 
n such that no n-point set is shattered by C is called VC-index of C. 

It is well known that the set class TZd is VC with index d -|- 1 (cf. van der 
Vaart and Wellner, 1996, Example 2.6.1). This yields asymptotic normality 
of Cn{u) uniformly in n G [0,1]'^. The asymptotic normality of empirical 
copulas follows then by functional Delta method. These functional CLTs 
allow to prove asymptotic normality for the estimators of the multivariate 
distribution function F{x) that are derived from Cn or C*. 

Unfortunately, the problem for G* is much more difficult. As shown 
above, a functional CLT for G* is closely related to a uniform CLT for G„ 
on the set class Td- The complexity of Td is much higher than that of TZd- 
Lemma 3.1 stated below implies that Td is not VC, and Remark 3.2(c) shows 
that the complexity of this set class is even so high that a uniform CLT on 
Td does not hold. 

Let Td denote the collection of all lower layers in [0,1]'^: 

Td := {SC[0,1]‘':(R) = R}, 

and denote Td \= {B ■. B E Td}- Analogously, denote Td ■= {B : B E Td} ■ 
According to Remark 2.3(c), B E Td implies that B \ B G c?[0,1]^, which is 
a Pc-null set for any copula G. Thus, for empirical processes constructed 
from copula samples, uniform convergence on Td is equivalent to uniform 
convergence on Td- 

It is obvious that Td C Td- The following result shows that for d = 2 
these set classes are almost identical. 

Lemma 3 . 1 . If B E T2, then RUA 2 E T2, where A 2 := {u E [0,1]^ : U1U2 = 0} 
is the union of lower faces 0 / [ 0 , 1 ]^. 

Proof. Denote B' := B U A2. It suffices to find probability distribution func¬ 
tions Fi, F2 such that {T(Ao)) = with T{x) = {Fi{xi), F2{x2)). Denote 


Fi{t) fl[o,i)(t) -|- l[i,oo)(^) 


and 


F2{t) ■- = 


0 

sup{s e [0, 1 ] : (— f, s) E B'} 
1 


if f < —1 
if f G [—1, 0) . 
if f > 0 
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Since B' G 772, the function F 2 is non-decreasing in t. Indeed, if (—f, s) G B\ 

then (—f — 5,s) G B' for any 6 G (0,f], and hence ^ 2(7 + 6) > F 2 {t). The 

maximal value of F 2 is 1, and F 2 is right continuous because B' is closed. 

Thus F 2 is a probability distribution function. 

As Fi and F 2 are non-decreasing, we have 

(r(Ao)) = {r(Mo)) 

where OAq := t G M} is the boundary of Aq. Now observe that 

T{dAo) = {(Fi(t),F2(-f)) : t G M} = {(t,F2(-f)) : t G [0,1]}. 

Hence {T(Ao)) is the area enclosed between the zero line and the graph of 

F 2 {—t) for t G [0,1]. This is precisely B'. □ 

Remark 3.2. (a) Since A 2 is a Pc-null set for any copula C, the modifica¬ 
tion of B into i? U A 2 in Lemma 3.1 has no influence on the uniform 
convergence of empirical processes obtained from copula samples. 

(b) The set classes 77^ and Rid are not VC. For instance, they shatter all 
sets {m G (0, ^,..., 1}'^ : Ml -I- ... -I- Md = 1} for n G N. Any subset B of 
this hyperplane in {0, 7,, l}'^ can be picked out by {B) G 77^ C 77^. 
Similar arguments apply to the modified set class {B U A^ : i? G 77^}. 
Hence Lemma 3.1 implies that T 2 is not VC. This rules out the canonical 
usage of VC criteria in convergence proofs for G*. 

(c) The problem is even more difficult, and also more remarkable. In fact, 
the set classes Rid and Rid for <7 > 2 are not Donsker with respect to the 
Lebesgue measure on [0,1]“^ (cf. Dudley, 1999, Theorems 8.3.2, 12.4.1, 
and 12.4.2). Thus Lemma 3.1 implies that a uniform CLT on T 2 does 
not hold in the most basic case, when C is the independence copula. 
Therefore one cannot prove asymptotic normality of G* via uniform CLT 
on 72, and precise asymptotic variance of G* also seems out of reach. 

(d) The complexity issues are not specific to the estimator G* obtained by 

plugging empirical margins Fi^n into the rank based empirical copula G*. 
They also affect models generated by plugging Fi^n directly into the “ex¬ 
act” copula G. Top-down simulation of such models means marginal 
transformation of copula samples ..., ~ G by F^. The result¬ 

ing estimate of the component sum distribution G(7) can be written as 
Pc„(('rn(A))) where Tn{x) := (Fi,„(a;i),.. .,Fd,nixi)). The sets (rn(Ai)) 
feature the same stair shape as the sets {Tn{At)). Asymptotic normality 
of Pcn{{xn{,At))) leads us again to a uniform CLT on Td- 
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(e) It is not yet clear whether Lemma 3.1 can be extended to 5 G Hd for 
d > 2. However, the complexity of 'Hd can only increase for greater d. It 
is easy to embed 772 in 'Hd for d > 2 by identifying H 2 with the following 
subclass: 


77^11,2 {BeHd-.B = B'x [0, l]''-^, 5' G 772} . 

Setting Fiit) := l[o,oo)(7) for 7 = 3,..., d in the proof of Lemma 3.1, the 
result obtained there can be extended to 77 U G 7^ for i? G 77 rf|i, 2 - 
This allows to extend the conclusions in (b,c,d) to all dimensions d > 2. 

Remark 3.3. (a) The results of this section can be summarized as follows: 
The true target set class 77 t := {(7"(A)) : 7 G M} is simple (it will be 
shown in Remark 4.12 that 77 t is VC with index 2), but unknown. Re¬ 
placing these unknown margins by the empirical ones, we obtain random 
elements of the set class 7d, which is too complex for a uniform CLT. 
This is the reason why the convergence proofs presented below sacrifice 
precise asymptotic variance. The resulting loss of precision can be con¬ 
sidered as the price one is forced to be pay for using empirical margins 
Fi^n instead of the true ones. 

(b) Similar issues can also arise when the margins are known, but the copula 
is not. The implicit use of transformations pn in C* (cf. proof of Corol¬ 
lary 2.4) entails deformations of target sets B G Ht that are very similar 
to the ones caused by T„ or Tn- Thus, depending on the application, loss 
of precision may also be caused by the use of an empirical copula. In 
particular, a uniform CLT for C* on the set class Ht is still an open 
problem, and the foregoing results suggest a plausible explanation why 
this problem is so hard. 

(c) A deeper reason behind these complexity issues is the typical shape of 
the target sets B G Ht- If the margins or the copula are estimated 
empirically, corresponding random transformations of B {Tn and in 
Corollary 2.4) significantly increase the complexity of the problem. De¬ 
pending on the application, the resulting loss of precision can be at¬ 
tributed to empirical margins, empirical copulas, or both. 

4 Convergence results 

The major problem studied in this paper is the uniform convergence of the 

Iman-Conover estimator G* introduced in (7). Strong consistency in /°°(M) is 

established in Theorem 4.1. A sufficient condition for the convergence rate to 
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be is given in Theorem 4.2. These results are stated below and followed 
by some corollaries, remarks, and auxiliary results needed in the proofs. The 
technical proofs of the auxiliary results are provided in Section 4.1. 

Theorem 4.1. Let G{t) := P(Xi + ... + < t) for X ^ F as defined 

in (1), and let G* be the Iman-Conover estimator of G introduced in (7). 
Assume that the functions Fi^n defined in (2) satisfy 

i|Pi,n - Pilloo0 P-a.s., i = l,...,d. (12) 

If the copula G of F is Lebesgue absolutely continuous, then ||G* — G||oo —^ 0 
P-a.s. 

As discussed in Remark 3.2(c), a CLT for G* seems out of reach, so that 
the convergence rate is established as an Op(n“^/^) bound. This notation is 
related to tightness: = Op(l) means that is tight, and Zn = Op(a„) is 

equivalent to = Op(l). In particular, if |Z„| < \Yn\ and Yn Y, then 

= Op(l). 

The regularity assumptions also need some additional notation. In the 
following, let Bt denote the “upper” boundary of {T{At)): 

Bt '■= 1G {T{At)) :V£>0 X -\- {s, ^ (T(At)) I f G M, 

and let UfiBt) denote the closed 5-neighbourhood of Bt in Euclidean distance: 

Us{Bt) := {u G [0, l]*^ : \u — v\ < 6 for some v G Bt] . (13) 

One of the regularity assumptions in Theorem 4.2 specifies the probabil¬ 
ity mass that the copula G assigns to UfiBt). The other one involves the 
Lebesgue density c of G. For e: G (0,1/2), we denote 

K{e) := ess sup {c(-u) : u G [e, 1 — eY] ■ 

The growth of K (e) for e —)• 0 specifies the behaviour of c near the boundary 
of [0,1]'^. 

Theorem 4.2. Let G{t) := P(Xi Xd < t) for X ^ F as defined 

in (1), and let G* be the Iman-Conover estimator of G introduced in (7). 
Assume that the functions Ft^n defined in (2) satisfy 

||Pi,n - Pilloo = Op(n“^/^), i = l,...,d, (14) 

and that the copula G of F is absolutely continuous and satisfies 

snpFc{U 5 {Bt)) = 0{6) (15) 

teR 
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Ul 


Ul 


Figure 1: Set families {Bt : t G M}. Left hand side: Xi ~ A/'(0,1) and 
X 2 ~ A/'(0,1/4). Right hand side: Xi ~ Exp(l) and X 2 ~ Exp(0.7). 

for 5 —)■ 0 and Us{Bt) defined in (13). Further, assume that 

\/\ogK (£2)d£ < cx). (16) 

Then IIG^-GIloo =Op(n-V2), 

Remark 4.3. The shapes of the sets Bt strongly depend on the marginal dis¬ 
tributions Fi. Two examples of set families {Bt : t G M} are given in Figure 1. 
The verification of (15) for some copula families is discussed in Section 5. The 
examples presented there suggest that this condition is non-trivial, and that 
it depends on the interplay between the copula C and the true, unknown 
margins F). 

We proceed with an auxiliary result that gives us an upper bound for the 
volume of UfiBt). It follows from the componentwise monotonicity of the 
transformation T. The general idea behind this result is that the “surface 
area” of Bt is bounded by the sum of its d projections on the lower faces 
of the unit square [0,1]'^, and the “thickness” of Us{Bt) is roughly 26. The 
proof is given in Section 4.1. 

Lemma 4.4. Let A denote the Lebesgue measure on [0,1]'^. Then X{Us{Bt)) < 
2d6. 
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The next lemma provides the Glivenko-Cantelli and Donsker properties 
for two set classes involved in the proofs of Theorems 4.1 and 4.2. The 
Donsker property is defined in (11). A set class C is called F-Glivenko- 
Cantelli if the empirical measure Pn(5) := ^ Yl'k=i of an i.i.d. sam¬ 
ple ..., ~ P satisfies 

||P„ - Pile := sup |P„(5) - P(5)| "4 0 (17) 

B&C 

This notation emphasizes that the convergence also depends on the true 
distribution that is sampled to construct P„. 

Remark 4.5. One technical aspect of (17) and (11) needs an additional com¬ 
ment. These statements regard P„ and P as mappings from the probability 
space (0,4, P) to However, P„ need not be measurable with respect 

to the Borel cx-field on 1°°{C) (cf. Billingsley, 1968, Chapter 18). This issue 
can be solved by extended versions of almost sure and weak convergence as 
presented in van der Vaart and Wellner (1996). In the following, —)■' and 4 
are understood according to that monograph. In case of measurability these 
extended notions coincide with the standard ones. 

Lemma 4.6. (a) The set class 

Ut :={(T(Ai)):teM} 

for a fixed T{x) = (Fi(xi),... ,Fd{xd)) is universally Glivenko-Cantelli 
and Donsker. 

(b) If C is Lebesgue absolutely continuous, then the set class 

Vs,:= {Us{Bt)-.6e[0,6o],teR} 
is Pc-Glivenko-Cantelli for any 5o > 0. 

(c) If C is Lebesgue absolutely continuous and satisfies (16), then is 
Fc-Donsker for any Sq > 0. 

The proof of this auxiliary result is given in Section 4.1. Now we proceed 
with the proofs of Theorems 4.1 and 4.2. 

Proof of Theorem f.l. For the sake of simplicity, we will write yi{B) instead 
of /i((H)) for any measure /i on [0, l]*^. According to (9), we have to show 
that Fc„{Tn{At)) —)■ Fc{T{At)) uniformly in f G M. It is easy to see that 

\Fc„{Tn{At))-Fc{T{A,))\ 

< Pc„ ((4(4)) A (T(4))) + |Pc„(T(4)) - Pc(T(4))|, (18) 
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where A denotes the symmetric difference: A A B := {A \ B) U {B \ A). 
According to Lemma 4.6(a), the set class Bt is Pc-Glivenko-Cantelli. Hence 
the second term in (18) converges to 0 P-a.s. uniformly in t G M. 

Now consider the first term in (18) and denote 

Y^:=\\T4x)-T{x)\\^. (19) 

As the transformations T„ and T are componentwise non-decreasing, Yn 
is a measurable random variable. Furthermore, symmetry arguments give 
us ||G^ - id[o,i]||oo = \\Ci,n - id[o,i]||oo, where id[o,i](M) := m for u G [0,1]. 
Hence (12) and the classic Glivenko-Cantelli theorem for Ci^n yield T„ T 
in This implies that Yn 0. 

It is also easy to see that 

{UA,)) A {T(A)) c Uy,(BX (20) 

where Us{Bt) is the set introduced in (13). Moreover, for any 5 > 0 we have 

Pc„ mB,)) < Pc mB,)) + |Pc.(f/i(B,)) - Pc(f/i(Bi))l ■ (21) 

As Yn 0, it suffices to show that for 5 —)■ 0 both terms on the right hand 
side of (21) vanish with probability 1 uniformly in t. In particular, for the 
second term it suffices to show that for some 5o > 0 

lim sup iFcAMBt)) - Pc{Us{Bt))\ = 0 P-a.s. 
ieIR,<5G[0,(5o] 

This follows from Lemma 4.6(b). 

The first term on the right hand side of (21) vanishes due to the ab¬ 
solute continuity of the copula C. Indeed, let e > 0. Since the density c 
of C is non-negative and J c{u) dX{u) = 1, there exists M > 0 such that 
I{c>M} '^(u) dA(M) < e/2. Then, for d < e/(4dM), Lemma 4.4 yields 

Pc (UsiB,)) < Pc„ (Us(B,) n {c < M}) + I 

< MX{Us{Bt)) + I 

< M2dS +- =e. 

~ 2 

That is, Pc(Us{Bt)) —)■ 0 for 5 —)■ 0 □ 

Proof of Theorem f.2. According to (18) and (20), we have that 

vS|G:(«)-G(i)| 

< v^Pc„ (UySb,)) + v^|Pc„{r(2i,)) - Pc(r(A))l. (22) 

18 

©2015, Elsevier. Licensed under the Creative Commons 
Attribution-NonCommercial-NoDerivatives 4-0 International 
http:// Greativecommons. org/licenses/hy-nc-nd/4- 0/ 



The final publications is now available at doi;10.1016/j.jniva.2015.07.008 


The second term in (22) is Op(l) uniformly in t G M due to Lemma 4.6(a). 

Now consider the first term in (22) and observe that 

= ^(Pc„(W.(B,)) - Pc(W.(B<))) + V^i-PciUrSB,)). (23) 

Applying the classic Donsker Theorem (cf. van der Vaart and Wellner, 1996, 
Theorem 2.5.7) to we obtain that - id[o,i]||oo = \\Ci,n - id[o,i]||oo 

is Op(n“^/^). Hence assumption (14) yields Yn = Op(n“^/^), and assump¬ 
tion (15) implies that the second term in (23) is Op(l). 

Let Zn denote the the first term in (23). As Yn = op(l), we have 

Z„ = l{y„ < 5a}^^iHVc,(UYSB,)) - Bc(Uy,(B,))) + op(l). 

for any 6o > 0. Now assumption (16) and Lemma 4.6(c) imply that Zn is 
weakly convergent, and hence Op(l). □ 

The following corollary allows to replace the empirical marginal distribu¬ 
tions Fi^n in Theorems 4.1 and 4.2 by any other consistent approximations 
of the true, unknown margins T). 

Corollary 4.7. Let Fi^n, i = I, ■ ■ ■ ,d, n E N, be arbitrary distribution func¬ 
tions onR, and letGnit) := Pi?*(At) with Ff{x) := Cf{Fi^n{,xi),..., Fd,n{xd)) ■ 

(a) If Fi^n satisfy (12) and C is absolutely continuous, then ||G* — G||oo —^ 0 
P-a.s. 

(b) If Fi^n satisfy (14) and C satisfies (15) and (16), then ||G* — G||oo = 
Op(n“^/^). 

Proof. Part (a). For each there is an approximation Fi^n • {0, ^,..., 1} 

that minimizes ||F),n~-^i,n||oo- If is obvious that ||Fi,n~-^i,n||oo ^ 1/n. Hence 
the estimator Gn{t) := Gf{{fn{At))) with fn{A) := {Gf^n°Fi,n{xi), ..., Gf^^Fd,n{xd)) 
satisfies the assumptions of Theorem 4.1, and therefore ||G„ — G||oo ^ 0. 
Moreover, 


mt) - Gn{t)\ < PcAiUAt)) A {Tn{A,))) 

< Pc’„(I7||f„_T„ll^+l|T„_T|l^(-Bt))- (24) 

As \\Tn — Tn\\oo ^ 0 and ||T„ — T||oo 0, the term (24) vanishes with 
probability 1 uniformly in f G M analogously to the first term in (21). This 
yields ||G* — G||oo —t 0 . 
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Part (b). If Fi^n satisfy (14), then so do Hence Theorem 4.2 yields 
IIGn — Glloo = Furthermore, (24) implies that 

\Gnit) - cm < \FcAUy„{B,)) - FciUy^{B,))\ + Fc{Uy^iB,)) (25) 

for Yn := ||T„ - T||oo + ||Tti - T\\^. As = Op(n“^/^), assumption (15) 
implies that sup^g^Pc(t7y^(i?t)) = Op{n~^Gy The first term on the right 
hand side of (25) is Op{n~^G^ uniformly in f G M due to Lemma 4.6(c). □ 

Remark 4.8. (a) Compared to G*, the multivariate distribution function 
K {x) obtained by plugging Fi^n into G* is much easier to handle. The 
deeper reason here is that F*{x) can be written as the empirical mea¬ 
sure Fc„ indexed with random elements of the rectangle set class TZd- In 
particular, if are defined according to (2), then, analogously to (9), 
we have 

K(x) = ■■■• Fd,n(xj}) 

= C„(Ct„ o O F^„(xi)) = Pc.({r„(i))). (26) 

As mentioned above, TZ^ is VC, and hence universally Donsker and 
Glivenko-Cantelli. Thus, due to (T„(a:)) G TZd, we can apply standard 
results to F*. 

(b) To prove strong consistency of F*, recall that any copula is a Lipschitz 
function with Lipschitz constant 1 (cf. Nelsen, 2006, Theorem 2.2.4). 
Therefore (26) yields 

||F:-F|U<||G„-G||oo + V„. (27) 

As mentioned below (19), assumption (12) implies that Yn 0. Hence 
||F* — F||oo —)■ 0 P-a.s. due to the classic Glivenko-Cantelli Theorem 
for empirical distribution functions. The extension to general Fi^n is 
analogous to Corollary 4.7(a). 

(c) In the proof of Theorem 4.2 it is shown that assumption (14) entails 
y-n = Op{n~^Gy Hence the Op (n ^/^) convergence rate for F* follows 
from (27) and the classic Donsker Theorem for empirical distribution 
functions. The extension to general Fi^n is analogous to Corollary 4.7(b). 

(d) If Fi^n satisfy a functional CLT, then the functional Delta method yields 
a functional CLT for F*, with precise asymptotic variance - see van der 
Vaart and Wellner (1996, Lemma 3.9.28) and Segers (2012) for further 
details. Unfortunately, this does not imply a functional CLT for G*, as 
G* is obtained by indexing Fc„ with a totally different set class. 
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Remark 4.9. Theorems 4.1 and 4.2, along with all their extensions and corol¬ 
laries, also apply to multivariate models generated by plugging empirical 
margins directly into the copula C. According to Remark 3.2(d), the 
resulting estimator of G{t) can be written as RcSi'^n^At))) with r„(x) = 
..., Fd^ri{xd)). Since G*^ = Pc„((T„(At))), extension of convergence 
results to R is straightforward. A closer look at the proof of 
Corollary 2.4 suggests that same is true for the convergence of Pc„((p^ ° 
T{At))) uniformly in f G M, where T{x) = {Fi{xi ),..., Fd{xd)) and Pn{x) = 
{Gi^n{,xi ),..., Gd,n{,Xd))- This setting corresponds to the combination of exact 
margins with the empirical copula C*. 

The final result in this section generalizes all foregoing results to a broader 
class of aggregation functions. Revising the proofs above, it is easy to see 
that the only property of the component sum used there is that it is compo¬ 
nentwise non-decreasing. This immediately yields the following extension. 

Corollary 4.10. Let a function T : —>■ M satisfy 

^{x)<^{y) if Xi <yi for i = 1,... ,d. 

Then all results stated above for the sum distribution G also hold for the 
distribution function of the aggregated random variable 4'(X). In par¬ 
ticular, the estimator := Pir*({x E : ^(x) < t}) converges P-a.s. 

to Gxii in /°°(]R) under the assumptions of Theorem f.l and has convergence 
rate under the assumptions of Theorem 4.2. 

Remark 4.11. (a) It depends on the aggregation function 4/ whether the gen¬ 
eralization stated above is advantageous. In some special cases even 
stronger results are possible. If, for instance, ^(x) = maxjxi,... ,Xrf}, 
then the convergence of ^ in l°° is related to the uniform convergence 
of the empirical measure Pc^ on the rectangle set class IZd fl [0, l]'^. As 
the latter set class is VC, one can derive a Donsker Theorem for „ 
with a precise asymptotic variance. 

(b) Another remarkable example is the Kendall process, which is obtained 
by taking the joint distribution function F as aggregating function 4/. 
The resulting aggregated distribution function is H(t) := P(F(X) < t). 
Using the notation from above, this means H := G^ for 4/ = F. The 
aggregated distribution function H{t) can be estimated by the empir¬ 
ical distribution := ^0; where are the 

Iman-Conover synthetic variables defined in (3) and Ff is their empiri¬ 
cal distribution function (cf. (4)). If the margins Fj are continuous, then 
F{X) has the same distribution as G{U) for U ^ G. Moreover, F*(X) 
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can always be written as C^{U) for U C. Thus the distribution of 
the process y/n{Hn{t) — H{t)) does not depend on the margins Fj. In 
this case asymptotic normality is also available (cf. van der Vaart and 
Wellner, 2007; Ghoudi and Remillard, 1998; Barbe et ah, 1996). 

(c) In the general case, however, extensions of Theorems 4.1 and 4.2 in¬ 
deed go beyond available convergence results for empirical multivariate 
distribution functions. 

4.1 Proofs of auxiliary results 

Proof of Lemma f.f. Denote 

:={(r(A)) + i(i,...,i)>n|o,i|-' 


and, subsequently, 

- 25e,) n [0,1]", * = 1 ,..., d. 

The notation A + x for 4 C and x G represents a shift of the set A, 
i.e., A + X ■.= {a + X ■. a & A}. Further, denote 



W'lr'’ \ w''. 


(*) 
S,t ) 


i = 1,..., d. 


The boundaries of are Lebesgue null sets, because any (4) for A C [0,1]'^ 
is Lebesgue-boundary-less. Indeed, the construction of (4) guarantees that 
if M G (4) and v G [0,1]'^, then v < u (componentwise) implies v G (4). 
Analogously, if m G [0,1]‘^\ (A) and v G [0, l]*^ with v > u, then v G [0,1]'^\ A. 
This monotonicity property allows to cover the boundary d{A) by 
d-dimensional cubes with edge length e for any e: > 0. The total volume of 
this coverage is 0{e), so that sending e: —)■ 0 we obtain A(c?(A)) = 0. This 
implies that all sets and are Lebesgue-boundary-less. 


It is obvious that Us{Bt) C \ . Moreover, the construction of 

entails that 


\ Kt = U 


Kb 


i=l 


and A(V'/^^) < 26 for all i. This yields XUs^Bf) < 2d5. 


□ 
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Proof of Lemma 4.6. According to van der Vaart and Wellner (1996, The¬ 
orem 2.4.1), a set class C is P-Glivenko-Cantelli if the bracketing number 
A'[](£,C,Li(P)) is finite for any e > 0. The number Li{P)) is 

the minimal amount of so-called e-brackets [V,hP] needed to cover C. An 
e-bracket [V, W] with respect to Ti(P) is a pair of sets satisfying V <ZW and 
P{W \ V) < e. A set class C is covered by brackets [V^, ITj], i = 1 ,..., A^, 
if each A G C satisfies Vi C A C Wi for some i. The criterion cited above 
is stated in terms of function classes, but it easily applies to set classes by 
identifying sets with their indicator functions. 

A sufficient condition for C to be P-Donsker is 

^logA^[] (e,C,L2(P))de < oc (28) 

(cf. van der Vaart and Wellner, 1996, Section 2.5.2). The distance of two sets 
A and B in T 2 (P) is related to their distance in Ti(P) via 

dL2{P){AB) = ||1^ - 1b||l2(p) = ^Lpp)("4,5)- 
Hence the T 2 (P) bracketing entropy condition (28) is equivalent to 

yiogiV[](£2,C,Li(P))d£ < cx). (29) 

Part (a). The set class "Hp is the collection of all {T{At)) for f G M with 
a fixed T. Since the sets At are increasing in t, and T is componentwise 
non-decreasing, we have {T{At)) C {T{As)) for t < s. Consequently, PLt 
can be covered by 0{l/e) brackets of size e with respect to Ti(P') for any 
probability measure P' on [0,1]*^. The brackets [V), H7] can be chosen as 
Vi = {T{Atf)), Wi = [Jt<ti+i{T{At)) with an appropriate finite sequence ti < 

... < tiq. If P'{T{At)) has jumps, then it may be difficult to choose ti such 
that P'iWi \ Vi) = e for all i. In this case we may have P'iWi \ Vi) < e for 
some i, but the total number of brackets is still 0 {l/e). Thus we have 

7V[](£,Ht,Ti(P')) = 0 (1/6), 

and PLt is universally Glivenko-Cantelli. If the maximal bracket size is 
then one needs 0(1/6^) brackets to cover PLt- This is sufficient for (29), and 
hence PLt is universally Donsker. 

Remark 4.12. It is also easy to show that the set class PLt is VC with index 
2 . As the sets {T{At)) are increasing in t, they cannot shatter any two- 
point set. Let G [0, l]*^, and let 0,0 G M be such that for fc = 1,2 

xP) g (T(At)) is equivalent to t > tk. Without loss of generality let ti < ^2- 
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Then 1-Lt cannot pick out and hence 1-Lt is VC. From here, universal 

Glivenko-Cantelli and Donsker properties follow if we verify F'-measurability 
of Ht for any probability measure P' on [0,1]'^ (cf. van der Vaart and Wellner, 
1996, Definition 2.3.3). This can also be done. 

However, since the VC property of remains elusive, the proofs of 
Parts (b) and (c) are based on bracketing entropy. This is the reason why 
the proof of Part (a) presented above is also based on bracketing. It gives a 
short preview of the ideas used below. 

Part (c). Fix £ > 0. As [0, [0, l]*^] is a bracket of size 1 in Li(Pc') covering 
any subset of [0,1]'^, we can assume that £ < 1. For this £ we define 

e 

^ 8dK{e/8d) 

and 

:= (^ 0 - 7 ) + - 

Given an arbitrary to ^ 1^ and ti > to, consider the following sets: 

ir,.,. := U 

K.u-= U 

K.,.. - n 

The bracket W4o,ti] covers all Us{Bt) for 5 G [5o)^i] and t G [to,ti). 

The size of this bracket in Li(Pc) equals 

PcW„,,. \ + Po(ni,. \ r,.,,.). 

Denote Jr, := [r], 1 — r]]'^ for p G [0,1/2), and let A be the Lebesgue measure 
on Jo = [0,1]'^. Then 

Pc(H^,.,<, \ VP;,,.) < Pc {Jo \ J,/m) + mtoM \ W',„,,;)K(ElSd). (30) 

As C* is a copula and has uniform marginal distributions, the first term on 
the right hand side satisfies 

d 

Pc {Jo \ Je/Sd) < 5^Pc ({w e [0,1]'" : Ui i \el8d, 1 - e/8d]}) = e/4. 
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To obtain an upper bound for \ tj, observe that 

\ W',:,.,. C ({f/&(B,.)) \ (Us,[B,,))) U ((l7j.(B,.))„p \ (f/<.(B„)>„p) 

where (i?)up := (1, ■ ■ ■, 1)] is the upper layer of B in [0,1]'^. Moreover, 

analogously to Lemma 4.4, one can obtain that 

A({C/*(B„)) \ < <i(io - ii) < rfT = 

Same bound holds for X{{Usg{Btg))^^p \ {Us^{Btg))g^p). Thus (30) yields 

(31) 

Consequently, as Pc is absolutely continuous, we can choose ti > to such 
that either fi < oo and Pc(147o.ti \ or Pc(ff7o,ti \ < £ and 

ti = oo. If ti < oo, then (31) implies that 

(32) 

Proceeding in the same way as above, we obtain an increasing sequence to < 
ti < t 2 < ■ ■ ■ that eventually terminates at cx). Technical difficulties related 
to possible jumps of Pc(hPct \ Vt^^t) for t > U can be handled analogously 
to the proof of Part (a). A similar construction yields a decreasing sequence 
to > f_i > ... that eventually terminates at —oo. 

We still have to show that the sequence tk is always finite, i.e., that t^ 
indeed assumes ±cx) for some k. Consider the sets Sk := \ 

and S'(, := \ Us^{Bti^). As Sk are disjoint for different fc, we have 

^^Pc(*S'fc) < 1 and, analogously, Xlfc Pc'(*S'(,) < 1. It is obvious that 

St us; = \ (C,(B,J n c/j,(B,.„)). 

Furthermore, monotonicity of T implies that 

C,(B,Jnc.(B,„.)= fl c,(ii,)cv;.,,.„- 

[^k ?^/c + l] 

This immediately yields \ C Sk U S^. Applying (32), we 

obtain that Pc(5'fc) + Pc(*S'(,) > e:/2 if tk and tk+i are finite. As the sum 
EkiPciSk) + Pc(5'(,)) is bounded by 2, the length of the sequence (tk) is 
bounded by dfl/e] +4. Possible discontinuities of Pc(IP/ t \ P)*,,*) may 
increase this number at most [2/e] additional steps (cf. proof of Part (a)). 

Thus we have shown that the set class {Us(Bt) : f G M, <5 G [(5i, 5o]} can be 
covered by 6 [l/e] +4 brackets of size e in Li(Pc). Defining 6k ■= (6k-i — 7 )+ 
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for k = 2,3,..., we reach 0 after [^o/hl = \^5QdK[e/^d)/e\ steps. As the 
arguments above apply to any interval [5^, 5fc_i], we obtain a coverage for the 
set class and hence an upper bound for the bracketing number: 

N^^{e,Vs,,L,{Vc)) = 0{e-^K{e/U)). 

According to (29), we need to verify that 

V^log(A' 4:d)e~^)([e < oo. (33) 

Changing the upper integral bound from cx) to 1 is justified by the fact that 
any set class can be covered by a single bracket of size 1. 

As \/\og(l/e)de < oo and 

^/log{K{e^/4d))e-'^ < ^/logK{e^/4:d) + 20og(l/£), 

the integrability condition (33) follows from the assumption (16). 

Part (b). According to proof of part (c), N[-^{e,VsQ, Li(Pc)) < oo for any 
£. Assumption (16) is needed only to verify (29). □ 



5 Examples 

In this section we discuss the regularity assumptions of Theorems 4.1 and 
4.2. The main results are stated in Propositions 5.2 and 5.3, verifying all 
regularity assumptions for bivariate Clayton copulas and bivariate Gauss 
copulas with correlation parameter p > 0. The case p < 0 is treated in 
Proposition 5.3(c), which guarantees the convergence rate Op{n~^^^y/\ogn). 
This is almost as good as Op(n“^/^). 

We start the discussion with a remark covering the mild integrability 
condition (16) and copulas with bounded densities. 

Remark 5.1. (a) It is easy to see that K{e) = 0(exp(£“^+^)) for e —)■ 0 
with some p > 0 implies (16). In particular, any polynomial bound 
K{e) = 0{e~^) for fc > 0 is sufficient. 

(b) An immediate consequence of Lemma 4.4 is that all copulas with bounded 
densities satisfy all regularity conditions of Theorems 4.1 and 4.2. A 
particularly important copula example with a bounded density is the 
independence copula C{u) = The Iman-Conover method with 

independence copula is a standard tool in applications with empirically 
margins based on real data. It is applied to generate multivariate sam¬ 
ples with margins that are close to independent or to remove spurious 
correlations from multivariate data sets. 
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Unfortunately, many popular copulas, such as Gauss, Clayton, Gumbel, 
or t copulas, have unbounded densities. In particular, a bounded copula den¬ 
sity implies that all tail dependence coefficients are zero. Thus applications 
related to dependence of rare events demand a deeper study of copulas with 
unbounded densities. The present paper provides two bivariate examples: 
the Clayton and the Gauss copula. 

The bivariate Clayton copula with parameter 6* G (0, oo) is defined as 

Ce{ui,U2) = + -l) 

The density cq can be obtained by differentiation: 


ce{ui,U2) = du^du,Ce(ui,U2) 

= {u^^ + uf-iy"~"^\9 + l){uiU2)-^-\ (34) 

The next result states that this copula family satisfies all regularity assump¬ 
tions of Theorem 4.2. 

Proposition 5.2. Any bivariate Clayton copula Ca with 6 G (0, cx)) satis¬ 
fies (15) and (16). 

Proof. To verify (16), it suffices to show that K{e) is polynomial (cf. Re¬ 
mark 5.1(a)). The density is given in (34). It is easy to see that 

— l)“2-i/e < X for u G (0, 1)^. Hence the order of magnitude of K{e) is 
determined by sup„g(g which is clearly polynomial. 

To verify (15), recall that the proof of Lemma 4.4 used the following 
coverage of the set UfiBt)'. 


d 

Us(Bt) c UG’. 

i=l 


Hence, for d = 2, we have 

PceiUsiBfi) < Pceivy) + Pcoivff). (35) 

The arguments that yield 0{S) bounds for Pcg iy^) are symmetric in i = 1, 2, 
so that it suffices to consider . For U2 G (0,1), denote 

Ui = Ml (M 2 ) := sup i Ml : (mi, M 2 ) G \ 


and 


Ui = Ui{u2) := inf \ Mi : (mi,M2) G 


41 ) 
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It is easy to see that 


Pc. (vili = 


1 nui 



ce{ui,U2)(iui(iu2. 


0 J u. 


Moreover, the construction of implies that 

Vu 2 G (0,1) ui(u 2 ) — Ui{u 2 ) < 26 A-a.s 
Partial differentiation of log cg yields 

e + 1 


du, \ogcg{ui,U2) = —7 

Ml + U 2 — 1 

Hence du^ \ogcg{ui,U 2 ) = 0 is equivalent to 


Ui 


, z = l,2. 


{29 + l)uf 

Ui^ + U2^ — 1 


— 0 + 1 , 


and for fixed U 2 G (0,1) the copula density C 6 i(mi,M 2 ) attains its maximum at 


Ml = Ml (M 2 ) := min 


0 + 1 


(m,-^ - 1 ) 


-1/9 


Furthermore, cg{ui,U 2 ) is increasing in mi for mi < m^ and decreasing in mi 
for Ml > Mp Let and denote the corresponding sub-domains of 
( 0 , 1 ) 2 : 

;= {u e (0,1)2 : Ml < Mt} , := {u G (0,1)2 : mi > m^} . 

An exemplary plot of the function u* with resulting sets D^\ is given 
in Figure 2. Note that the function M 2 e->• Mi(m 2 ) is non-decreasing for any 
0 > 0 . 

We will show that I”* I”* -D-^) are bounded by 

26. Denote 

jW ;= vr 2 n and ;= 7 r 2 H , 

where 7 r 2 is the projection on the second coordinate: vr 2 ((Mi,M 2 )) := M 2 . 
Further denote Si^'^ := {m G : Mi = Mj(M 2 )} and := 7 r 2 (S'i^^). It is easy 

to see that H Hence we can write 


Pc, 1 hi n of) = 






Cg{u) dMidM2 + 


r(i) 


Cg{u) dUidU2 

(36) 
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Subdivision of the unit square 


Slicing and shifting 




Figure 2: Bivariate Clayton copula: the curve U 2 {ul{u2),u2) subdividing 
the unit square (left) and the resulting slicing and shifting argument (right). 

Denote v\ := sup{Mi : (mi,M2) G This definition implies that 

Fi(m 2 ) < for all U 2 € \ Moreover, it is easy to see that v\ = 

supm(M 2 ) : U 2 G As ^^(^ 2 ) is non-decreasing, this yields < u\{u 2 ) 

for U 2 G \ This gives us 

Vm 2 G \ 4^^ Ml (M 2 ) <vl< Ml (M 2 ) 

and, as a consequence, C C [0,Mi] x (0,1). If vl < 26, then the 

uniform margins of the copula Cg immediately yield Pc.(h;> n B?) < 26 . 
Hence, without loss of generality, we assume that m^ > 26. 

As ce is increasing in ui on and mi — m^ < 26 A-a.s., we obtain that 

Vm 2 G\/ C 0 (M)dMi< / ce{u)(\.ui A-a.s. (37) 
J Ui Jv^ — 2S 

Moreover, it is easy to see that if M 2 G 4^^ and mi G [m^, m^], then mi > hi —25. 
This yields 

ru* rv* 

Vm 2 G / C 0 (M)dMi< / Cg{u)dui. (38) 

J Ui Jv'^—2S 
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Combining (37), (38), and (36), we obtain that 

Pc, (hi n P’+’) < Pc. (K - 26, v\] X |0,1|) = 26. (39) 

The latter equality is due to the uniform margins of the copula C. 

The proof of (37) formalizes the idea of slicing the set along ui for 

every U2 E \ and shifting each slice x {M2} upwards along ui 

until this slice touches the point (m{, M 2 ) as illustrated in Figure 2. Since cg is 
increasing in mi on the transformed set has a larger probability under 
Pq. 

Using the fact that cg is decreasing in mi on one easily obtains the 
following analogue to (37): 

rui 

Vm 2 e \ / Cg(u)dui< / Cg(u)dui A-a.s., 

J J 

where := inf{Mi : (mi, M 2 ) E fl 71^^}. This result is obtained by slicing 

U/P along Ml for all M 2 G \ and shifting each slice [mi,Mi] x {M 2 } 
downwards along mi until it touches the point (m{,M 2 ), cf. Figure 2. 
Similarly to (38), we have that 

/*iZi /*^*+25 

Vn 2 E /(P / Ce(M)dMi< / Cg(u)dui. 

J J V* 

Hence, analogously to (39), we obtain that 

Pc. (kT n bP') < 2i, 


and, consequently, Pc,(h,'’) < 46. 

/o\ 

As the Clayton copula is symmetric in mi and M 2 , we also have pcm:') < 
4(5. Thus (35) yields Pcg(f5^s(.Bt)) < 85, and condition (15) is satisfied. □ 


The next example is the bivariate Gauss copula Cp, defined as the copula 
of a bivariate normal distribution with correlation parameter p E (—1,1). 
The parameter value p = 0 yields the independence copula, which is the 
uniform distribution on the unit square (0,1)^. In this case all regularity 
conditions are satisfied (cf. Remark 5.1(b)). 

Let $ denote the distribution function of the univariate standard normal 
distribution, and for Mi,M 2 E (0,1) let qi = qiiui) := <F“^(Mj) denote the 


corresponding standard normal quantiles. Further, let S 


1 P 
P 1 


denote 
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the 2x2 correlation matrix corresponding to p. Then the bivariate Gauss 
copula Cp for p 7 ^ 0 can be written as 

Cp{ui,U2) = J j det(S)“^/^ exp dxida: 2 , 

where x is considered as a bivariate column vector and is the transposed 
of X. The copula density Cp can be obtained by differentiation: 


cp{^uiyU2) du2duiCp(^ui^ U 2 ') 

= det(S)-V2exp (j - S"') , 


where g = (gi, g 2 )''^ and / is the identity matrix. 

Proposition 5.3. (a) The bivariate Gauss copula Cp always satisfies (16). 

(b) If p>0, then Cp satisfies (15). 

(c) If p <0, then the estimate G* of the sum distribution C satisfies 

I|G; - GL = Op (n-‘/^yi^) ■ (40) 

Proof. As mentioned above, the case p = 0 is trivial. Hence we assume p 7 ^ 0. 

Part (a) It is easy to verify that {I — S“^)g < Mq^q for any fixed 
p e (—1, 0) U (0,1) with some constant M = M{p) > 0. Hence, for £ —)■ 0, 

K{e) = O (^exp (^M |<h-^(£)|^)) . 

Let 0 denote the standard normal density: 0(t) := (27r)“^/^ exp(—|t^). It is 
obvious that ^{t) = ds < J^^—S(p{s) ds = (fit) for t < —1. This 

yields 


I® ^(^)| < |0 ^(^)| = \/-21og(v/^£) (41) 

for sufficiently small e > 0. Hence we obtain that 

K{e) = O ^exp 2Mlog j j = O , e —)■ 0. 

Thus condition (16) follows from Remark 5.1(a). 

Part (b). The verification of (15) for p > 0 is analogous to Proposition 5.2. 
Due to 


J - 


1 

1 — p2 


-P^ P \ 

P -P^)' 


( 42 ) 
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p = 0.5 p = -0.5 




Figure 3: The curves U 2 e-)■ (m^(m 2 ),M 2 ) for the bivariate Gauss copula with 
correlation p = ±0.5. 


partial differentiation of log Cp in ui yields 


logCp(Mi,M2) 


1 

(1 - p2)</,(gi) 


{-p^qi+ pq2)- 


Hence Cp{u) = 0 is equivalent to 

Ml = Mi(m2 ) := <h ($"^(m2)/p) • (43) 

Moreover, it is easy to see that Cp{u) with p > 0 is increasing in mi if mi < 
u\ and decreasing in ui if Mi > u\. This is illustrated in Figure 3. As 
u\{u2) is increasing, the slicing and shifting argument used in the proof of 
Proposition 5.2 applies here, and we obtain that Pcp(f( 5 *'t^) < 45. Due to the 
symmetry of Cp{u) in mi and M 2 , partial differentiation in M 2 and the slicing 
and shifting method yield Pcp(V) 5 *'^^) < 45. Hence (35) gives ns FcpiUsiBt)) < 
85. 

Part (c). The situation for p < 0 is different. The copula density Cp{u) 
is still increasing in mi for mi < m^ and decreasing in mi for mi > m^, but the 
function m^(m 2 ) is decreasing (cf. Figure 3). Thus (37) does not hold here. 
Instead of shifting each slice [m^, mi] x {M 2 } as in the proof of Proposition 5.2, 
one can shift it until it touches the point (m{,M 2 ). If m{ G [M]^,Mi], no shift is 
needed. That is, we replace the interval [m;^,Mi] by the interval [M]^,Mi] ± A, 
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where 


U\ — Ui if Ml < u\, 

A = A(m2) ■= { if > ul, 

0 else. 


It is easy to see that 


Vm 2 e (0,1) 


rui 


Cpiu) dwi < 


cui+A 


’ Ui+A 


Cp{u) dwi 


Integrating over M 2 , we obtain that 

Pc, (hi) < Pc. (!/•), 

where U* := {m G (0,1)^ : |mi — m ^( m 2 )| < 26}. It is easy to see that 
{ueU* : m*(m 2 ) ^ {26, 1 - 26)} C ([0,45] U [1 - 45,1]) x [0,1]. 
Hence, as Cp is a copula and has uniform margins, we obtain that 
Pc, ({w e U* : Mt(M2) i (25,1 - 25)}) < 85. 

Denote the remaining part of U* by Uq\ 

u; := {m e f/* : Mt(M 2 ) G (25,1 - 25)}. 

As m}(m 2 ) maximizes Cp for fixed M 2 , we have that 

Pc, ((/„•) < a / 

-1(1-25) 

Applying (42) and (43) we obtain that 


Cp{ul{u2),U2) dU2. 


Cp(m}(m 2 ),M 2 ) = \/l -p^exp Q (4) ^(m 2 ))^^ 


Thus we need an upper bound for the integral 


/■“i /X „\ 

1(5) := / exp I - ($"^(m 2 )) ) dM 2 . 

Ju*-^(l-2S) / 


lu*-^{l-2S) 

The substitution M 2 = ‘h(t) yields 


1(5) = 


t3--i(25) 


V2 


dt = 


4>-^(25)|. 
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Finally, applying (41), we obtain that 

/(5) < j^\l-2\og{25y/^) = O log^l) 

for (5 —)■ 0. This implies that 

Pc„ (hi) = O («v7i^) ■ (44) 

Symmetry arguments yield the same order of magnitude for and, as a 
consequence, for VcpiUs^Bt)). 

Obviously, (44) is slightly weaker than 0{6) in assumption (15). Revising 
the proof of Theorem 4.2, we see that (15) is used to guarantee that the 
second term in (23) is Op(l). Hence, replacing (15) by (44) in Theorem 4.2, 
one obtains (40). □ 

Remark 5.4. (a) It is currently an open question whether the weaker result 
of Proposition 5.3(c) reflects the reality or simply arises from the approx¬ 
imations used in the proof. However, it should be noted that the case 
p < 0 is indeed more difficult than p > 0. For p < 0 the curve (mi(m 2 ), ^ 2 ) 
may be much closer to the set Bf, and hence Pcp{Us{Bt)) may be sub¬ 
stantially larger than for p > 0. In particular, if Xi ~ A/'(0,1) and 
X 2 ~A7(0,l/p2), then 

{(til(M 2 ), M 2 ) : U2 e (0,1)} = T ({x e : Xi -f X2 = O}) 

= Ron (0,1)2. 

That is, for p < 0 one can be confronted with the worst case when some 
Bt entirely falls into the area where the copula density is at its largest. 
A graphic example to this issue is given on the left hand side of Figure 1. 
The set Bq in that plot coincides with the set {(m^(m 2 ),M 2 ) : M 2 ^ (0,1)} 
from the right hand side of Figure 3 (p = —0.5). Such coincidence is not 
possible for p > 0 or for the Clayton copula. 

(b) Intuitively speaking, this problem originates from the negative depen¬ 
dence for p < 0, where large values of Xi tend to be associated with 
small values of X 2 and vice versa. As a consequence, the probability 
P(Xi -|- X 2 < 0) is influenced by tail events. This effect is much weaker 
for Gauss copulas with p > 0 and, analogously, for many other copulas 
with positive dependence. 

(c) The margins F) may also influence the convergence of the estimated sum 
distribution function G*. In insurance and reinsurance applications the 


34 


©2015, Elsevier. Licensed under the Creative Commons 
Attribution-NonCommercial-NoDerivatives 4-0 International 
http:// Greativecommons. org/licenses/hy-nc-nd/4- 0/ 







The final publications is now available at doi;10.1016/j.jniva.2015.07.008 


components Xi ~ Fi are often non-negative. In this case Xi + X 2 < t 
implies that Xi < t for i = 1, 2, so that the tail events have no influence 
on G{t) = P(Xi -|- X 2 < t) for moderately large t. The resulting sets 
Bf for f < 00 do not contain any internal points of the unit square 
that are close to the upper left or to the lower right vertex. That is, 
u E Bt and Ui > 1 — e with a small e implies U 2 = 0, and U 2 > I — e 
implies Ui = 0. These Bt avoid the areas where the density of the 
Gauss copula with p < 0 is at its highest. Thus non-negative margins 
simplify the estimation of the function G for the Gauss copula with p < 0 
and, analogously, for many other copulas with negative dependence. An 
illustration to the different types of sets Bt is given in Figure 1. 


6 Conclusions 

This paper proves that Iman-Gonover based estimates for the distribution 
function of the component sum are strongly uniformly consistent, and it pro¬ 
vides sufficient conditions for the convergence rate Besides the 

component sum, these results hold for any other componentwise non-decreas¬ 
ing function. The underlying mathematical problem goes beyond the classic 
uniform convergence results for empirical copulas. In the context of the 
Iman-Gonover method, the primary cause for this technical difficulty is the 
implicit usage of empirical marginal distributions. Similar issues also arise 
in all multivariate models generated by plugging empirical margins into an 
exact copula or by plugging exact marginal distributions into an empirical 
copula. The marginal transformations involved in these applications com¬ 
plicate the resulting estimation problem in a way that does not allow to 
establish asymptotic normality of the estimated sum distribution. Therefore 
the weaker statement is quite the best result one can achieve. 

The results proved here for the Iman-Gonover method extend to all 
models generated by plugging empirical margins into an exact copula or by 
plugging exact margins into an empirical copula. The regularity conditions 
needed for the convergence rate qj-q satisfied for all copulas with 

bounded densities, all bivariate Glayton copulas, and bivariate Gauss copulas 
with correlation parameter p > 0. The best convergence rate that could be 
established for the bivariate Gauss copulas with p < 0 is Op(n“^/^\/logn). 
This result suggests that negative dependence may slow down the conver¬ 
gence of estimates. On the other hand, non-negative components Xj, as 
typical in insurance applications, may simplify the problem. The proof tech¬ 
nique used for the bivariate Glayton copula applies to bivariate Gauss copulas 
with p > 0 and may also work for other bivariate copulas with positive depen- 
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dence. A straightforward generalization of this method to higher dimensions 
is, however, not feasible. Thus the question for the convergence rate in the 
d-variate case with d > 2 and an unbounded copula density is currently open. 
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